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Abstract. Sublinear time algorithms represent a new paradigm 
in computing, where an algorithm must give some sort of an an- 
swer after inspecting only a small portion of the input. The most 
typical situation where sublinear time algorithms are considered 
is property testing. There are several interesting contexts where 
one can test properties in sublinear time. A canonical example is 
graph colorability. To tell that a given graph is not fc-colorable, 
it is often sufficient to inspect just one vertex with incident edges: 
if the degree of a vertex is greater than k, then the graph is not 
fc-colorable. 

It is a challenging and interesting task to find algebraic prop- 
erties that could be tested in sublinear time. In this paper, we 
address several algorithmic problems in the theory of groups and 
semigroups that may admit sublinear time solution, at least for 
"most" inputs. 



1. Introduction 

Typically, to give some information about an input, an algorithm 
should at least "read" the entire input, which takes linear time in 
"length", or complexity, of the latter. Thus, linear time was usu- 
ally considered the "golden standard" of achievement in computational 
complexity theory. 

Sublinear time algorithms represent a new paradigm in computing, 
where an algorithm must give some sort of an answer after inspecting 
only a small portion of the input. Given that reading some data takes 
too long, it is natural to ask what properties of the data can be detected 
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by sublinear time algorithms that read only a small portion of the 
data. Sublinear time algorithms for decision problems are examples of 
property testing algorithms. 

In broad terms, property testing is the study of the following class 
of problems: 

Given the ability to perform local queries concerning a 
particular object (e.g., a graph, or a group element), 
the task is to determine whether or not the object has a 
specific property. The task should be performed by in- 
specting only a small (possibly randomly selected) part 
of the whole object. 

Often, a small probability of failure is allowed, especially when effi- 
ciency is more important than accuracy; this makes a difference with 
"usual" decision algorithms that have to give correct answers for all in- 
puts. (By "failure" here we mean a situation where an algorithm cannot 
give a conclusive answer, but we do not allow an algorithm to give a 
wrong answer.) In this sense, one of the ideas behind using sublinear 
time algorithms is similar to that of using genericity, i.e., assessing 
complexity of an algorithm on "most" inputs, see e. g. [9l 110^ [TT]. 

Property testing algorithms offer several benefits: they save time, 
are good in settings where some errors are tolerable and where the 
data is constantly changing, and can also provide a fast check to rule 
out bad inputs. An additional motivation for studying property testing 
is that this area is abundant with fascinating combinatorial problems. 
Property testing has recently become an active research good 
recent survey is |17j . 

In this paper, we address several algorithmic problems in the theory 
of groups and semigroups that may admit sublinear time solution, at 
least for "most" inputs. One of these problems is a special case of 
the well-known Whitehead's problem: given two elements of a free 
group F, find out whether or not one of them can be taken to the 
other by an automorphism of F. This problem was solved long time 
ago by Whitehead himself, but the complexity of the solution is still 
a subject of active research. It is not hard to show, for example, that 
those elements (represented by freely reduced words) which cannot be 
taken to a free generator (i.e., non-primitive elements) can be detected 
by a sublinear (with respect to the length of an input element) time 
algorithm with a negligible probability of failure; see our Section [2j 
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Another problem that we consider is the word problem. It is fairly 
easy to show that testing sublinear-length subwords of a given (freely 
reduced) word g cannot help in deciding whether or not g = 1 hi G un- 
less G is a free group because one has to at least test a subword of length 
about ^\g\- However, with semigroups the situation is different, so we 
want to find (natural) examples of semigroups where the word prob- 
lem admits a sublinear time solution for "most" inputs. One potential 
source of such examples is "positive monoids" associated with groups, 

1. e., monoids generated by group generators, but not their inverses. We 
address this problem in Section U] for positive monoids associated with 
free nilpotent group, with Thompson's group F, and with braid groups. 
It turns out that of these positive monoids, only those associated with 
braid groups admit sublinear-time detecting of inequality at least for 
some pairs of words. 

2. Background: sublinear time algorithms in graph theory 

There are several interesting contexts where one can test properties 
in sublinear time. For example, in [7], the authors focused their at- 
tention on testing various properties of graphs and other combinatorial 
objects. In particular, they considered the property of fc-colorability. 
This property is NP-complete to determine precisely but it is easily 
testable; more specifically, one can distinguish /c-colorable graphs from 
those that are e-far from /c-colorable in constant time. (Two graphs G 
and H onn vertices are e-close if at most en^ edges need to be modified 
(inserted or deleted) to turn G into H. Otherwise, G and H are e-far.) 

The work of [7] sparked a flurry of other results; in particular, an 
interesting line of work was initiated in [l], where the authors showed 
that the property of a graph being iJ-free (that is, the graph does not 
contain any copy oi H as a. subgraph) is easily testable for any constant 
sized graph H. 

In general, the area of property testing has been very active, with a 
number of property testers suggested for graphs and other combinato- 
rial objects, as well as matrices, strings, metric spaces, etc. 

In this paper, we discuss sublinear time property testing in the con- 
text of some particular problems in combinatorial theory of groups and 
semigroups. Testing some of these properties amounts to testing a 
graph (e.g. the Whitehead graph of a free group element), and there- 
fore fits in with the original ideas of sublinear time property testing 
that come from graph theory. To give an example, we describe here 
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a particular property of a free group element that can be tested in 
sublinear time in the length of the input element. 

3. Testing primitivity in a free group 

Let Fr be a free group of rank r > 2 with a fixed finite basis X = 
{xi, . . . , Xr}- An element g £ Fr is called primitive if it is a member of 
some free basis of Fr- Or, equivalently, if there is an automorphism of 
Fr that takes g to xi. 

A natural property of a given element u € Fr one might want to 
test is whether or not u is primitive. We show that for "most" inputs, 
this can be done in time sublinear in the length of u. We have to note 
one subtle distinction between what we are going to show here and 
what was established in [11]. Prom the results of [11], it follows that a 
"generic" element u € Fr is not primitive (moreover, its length cannot 
be decreased by any automorphism of Fr). However, these results are 
only applicable if u was chosen uniformly randomly from the set of all 
(freely reduced) words of length < N, for some A^. Furthermore, given 
a particular element u £ Fr, the results of [llj do not allow one to 
check (in linear time, say) that u is, indeed, non-primitive. 

What we are going to show here is that, after testing a small part 
of a cyclically reduced word u, one can, for a "generic" freely reduced 
u, tell for sure (i.e., with a rigorous proof) that u is not primitive. To 
explain this, we have to introduce the Whitehead graph first. 

The Whitehead graph Wh{u) of a (cyclically reduced) word u £ Fr 
is obtained as follows. The vertices of this graph correspond to the 
elements of the free generating set X and their inverses. For each 
occurrence of a subword the word u, there is an edge in Wh{u) 

that connects the vertex Xi to the vertex if u has a subword Xixj^, 
then there is an edge connecting Xi to xj, etc. There is one more edge 
(the external edge) included in the definition of the Whitehead graph: 
this is the edge that connects the vertex corresponding to the last letter 
of u to the vertex corresponding to the inverse of the first letter. 

It was observed by Whitehead himself (see also [20] ) that the White- 
head graph of any cyclically reduced primitive element of length > 2 
has either an isolated edge or a cut vertex, i.e., a vertex that, having 
been removed from the graph together with all incident edges, increases 
the number of connected components of the graph. Obviously, if the 
Whitehead graph has a Hamiltonian circuit (i.e., a circuit that contains 
all vertices of the graph), then it cannot have a cut vertex. Our test 
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is therefore pretty simple: pick a random subword v of u, of length 
sublinear in \u\, say, of length \u\^ for some < 6 < 1. It follows from 
results of that all possible 2-letter subwords are going to be present 
in V with overwhelming probability. Having checked that (which takes 
linear time in \v\, and therefore sublinear time in |m|), we conclude that 
the Whitehead graph of v is complete, hence the Whitehead graph of 
u has a Hamiltonian circuit, whence u is not primitive. 

We note, in passing, that the problem of detecting a Hamiltonian 
circuit in an arbitrary given graph is well known to be computationally 
hard (in fact, NP-complete) in the worst case [6], but it is also known 
to be easy for "most" graphs (it is even easy "on average", see [8]). 

It is an interesting question whether sublinear time algorithms can 
be found for other instances of the Whitehead problem (=automorphic 
conjugacy problem) in a free group, so we ask: 

Problem 1. Let v G be arbitrary but fixed. Is there a generic subset 
S (see our Section\^ Definition \4-S^ of Fr and an algorithm Av such 
that for any u £ S, the algorithm Av is able to detect, in time sublinear 
in the length of u, that u cannot be taken to v by any automorphism of 
F ? 

4. The word problem in semigroups 

If a group (or a semigroup) G is given by a recursive presentation in 
terms of generators and defining relators: 

G = {xi,X2, . . . ,Xn \ ri,r2, . . .), 
then the word problem for G is: given a word g = g{xi,X2, ■ ■ ■ , Xn), find 
out whether or not g = 1 in G. The word problem is known to have 
linear time solution for hyperbolic groups. 

As we have mentioned in the Introduction, it is fairly easy to show 
that testing sublinear-length subwords of a given word g cannot help 
in deciding whether or not g = 1 m a group G unless G is free and 
g is freely reduced. Indeed, suppose generators of G satisfy a relation 
r = r(xi, . . . , x„) = 1. Then, given a (freely reduced) word g of length 
m, the initial segment of g of length < ^ will have r as a subword 
with probability converging to 1 exponentially fast as m — >• oo. Since 
any cyclic shift of a word representing the identity also represents the 
identity, we may assume, without loss of generality, that our initial 
segment of g of length < ^ ends with r, i.e., it is of the form ur. Then, 
if g is of the form urr'u~^, where r' is any relator in G, it represents the 
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identity. Therefore, examining a subword of length < ^ of a generic 
word of length m cannot possibly help to guarantee that g ^ 1 in G. 

However, with semigroups the situation is different, so we address 
here the following, perhaps somewhat vague, problem: 

Problem 4.1. Are there natural examples of semigroups given by gen- 
erators and defining relators, where the word problem admits a sublinear 
time solution for "most" inputs? 

Note that the word problem for semigroups has a slightly different 
wording (excuse the pun): given two words g,h in generators of a 
semigroup G, find out whether or not g = h in G. Of course, if an 
algorithm for a sublinear time solution of the word problem exists, it 
will only give "negative" answers, i. e., g ^ h in G. This is similar 
to results of [9], where (generically) linear time solution of the word 
problem was offered for several large classes of groups; their solution, 
too, gives only "negative" answers. 

First we have to clarify the meaning of "most" inputs in this context. 
To that end, we recall the definition of a generic set from [9]. The 
most general and straightforward definition is based on the notion of 
asymptotic density. 

Definition 4.2. Suppose that T is a countable set and that £ : T — )• N 
is a function (referred to as length j such that for every n G N the 
set {x £ T : i{x) < n} is finite. If X T and n > 0, we denote 
pe{n,X) := #{x G X : i{x) < n} and -/iin,X) = E X : i{x) = 
n}. 

Let 5 C T. The asymptotic density of S in T is 

Pt,aS) := hmsup = limsup— — — , 

n->oo #{x G T : l{x) < n\ n^oo Pe{n,T) 

where we treat a fraction ^, if it occurs, as 0. 

// the actual limit exists, we denote it by pr/iS) and call this limit 
the strict asymptotic density of S iuT. We say that S is generic in T 
with respect to £ if pT,t{S) = 1. 

In our situation, T is the set of all words in a given (finite) alphabet 
X = {xi,X2, . . . , and (.{w), w £ T is the usual lexicographic length 
of w that we often denote simply by \w\. Thus, given a semigroup G 
generated by X, we are looking for a generic set S" C T of words 
such that for any g,h G S, there is a sublinear time in n = + 
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(probabilistic) algorithm proving that g ^ h m. G with probability 
1 — e(n), where e(n) — )• as n — )• oo. 

As we have pointed out in the Introduction, one potential source of 
semigroups with the property in question is "positive monoids" asso- 
ciated with groups, i.e., monoids generated by group generators, but 
not their inverses. For some particular groups, e.g. for braid groups, 
Thompson's group, these monoids have been extensively studied, and 
because of very nontrivial combinatorics involved in these studies, it 
would be quite interesting to either obtain a sublinear time algorithm 
for solving the word problem in these monoids or prove that none ex- 
ists. Negative results would be interesting, too, because lower bounds 
on complexity are always valuable. 

Another important class of positive monoids is associated with free 
nilpotent groups; these monoids have a special name of strictly nilpotent 
semigroups, see [19]. They are called strictly nilpotent because there 
are several other definitions of nilpotency for semigroups; for a survey 
on these and on how they are related to strictly nilpotent semigroups we 
refer to [19] or [18]. Here we just say that nilpotent semigroups, under 
various definitions, have been extensively studied from many different 
perspectives (see e.g. [12] or [T8]). 

In the following three subsections, we are going to show that of the 
three kinds of positive monoids (associated with free nilpotent groups, 
with Thompson's group F, and with braid groups), only those associ- 
ated with braid groups admit sublinear-time detecting of inequality at 
least for some pairs of words. 

4.1. Positive monoid of a free nilpotent group. Positive monoids 
of free nilpotent groups are called strictly nilpotent semigroups, see 
|19j . We have to give some background here because properties of free 
nilpotent groups are not as well known these days as properties of braid 
groups or Thompson's group are. 

Magnus [H] considered the embedding of the free group F with a 
free generator set X into the power series ring with the same set X of 
generators and proved that a group element which belongs to ^c{F), 
the cth term of the lower central series of the group F, is mapped to 
a power series without non-constant terms of degree less than c. The 
converse result (i.e., that any group element which does not belong to 
7c (-F), is mapped to a power series with some non-constant terms of 
degree less than c) appeared to be quite difficult to prove. Probably 
the first full and correct proof was given by Chen, Fox and Lyndon 
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in [5]. They considered the free group ring instead of the power series 
ring and proved that 

7c(F) = (A^ + 1) n F, 

where Aj? is the augmentation ideal of the free group ring ZF, i.e., the 
kernel of the natural "augmentation" homomorphism e p : ZF — ^ Z 
that takes all elements of F to 1. 

Now let Mc denote the positive monoid of the free nilpotent group 
F/'jc+i{F) of class c, where F is a free group of rank r > 2 with a free 
generator set X. We do not include the rank r in the notation because 
our results in this section are independent of r. We have: 

Lemma 1. Elements of Mc satisfy all identities of the form ac = be, 
where ac and be are words in X such that (ac — be) G A'^^-'^ . 

To prove the main result of this section, we will need to combine 
this lemma with the following result due to A. I. Mal'cev [15] and, 
independently, to B. Neumann and T. Taylor [16]. To better tailor (no 
pun intended) this result to our needs, we give it here in a weaker form. 

Lemma 2. Let 

Uo = X, Vo = y; Un+l = UnVn, ^n+l = VnUn, 

where x, y are arbitrary elements of a free group F . Then Uc = Vc 
modulo 7c+i(-F). 

By combining Lemma [Hand Lemma [21 we get 

Proposition 4.1. For any two positive words wi and W2 of length n in 
an alphabet X, there are positive words zi and Z2 of lengths < (n — 1)-2'^ 
such that wizi = W2Z2 in Mc- 

In particular, given two positive words of length L one cannot tell 
that they are not equal in Mc by just inspecting the prefixes of length 
< i.e., there is at least no obvious sublinear time algorithm for 
detecting inequality in M^. 

Proof. Construct the Mal'cev-Neumann- Taylor sequence of words start- 
ing with uq = wi, vq = W2- Then Uc has wi as a prefix, Vc has W2 as a 
prefix, Uc = Vc in Mc, and the length of both Uc and Vc is n ■ 2'^. □ 
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4.2. Positive monoid of Thompson's group F. Thompson's group 
F is well known in many areas of mathematics, including algebra, ge- 
ometry, and analysis. For a survey on various properties of Thompson's 
group, we refer to This group has the following nice presentation 
in terms of generators and defining relations: 

F = {xq, xi,X2,... I XkXi = XiXk+i [k > i)). 

Since all defining relators in this presentation are pairs of positive 
words, we can consider the positive monoid associated with this pre- 
sentation; denote it by F~^. 

We note that the above (infinite) presentation allows for a convenient 
normal form. We do not really need it in this paper, but we describe it 
here anyway. The classical normal form of an element of Thompson's 
group is a word of the form 

•^ll ■ ■ ■ -^ts-^jf ■ ■ ■ -^j^ 1 

such that the following two conditions are satisfied: 
(NFl) h < ... < is andii < . . . < jt 

(NF2) if both Xi and x~^ occur, then either Xj+i or occurs, too. 
Now we get to the point of this section. 

Proposition 4.2. For any two positive words wi and W2 of lengths 
m and n, respectively, in the alphabet X = {xq, xi, X2, . . .}, there are 
positive words zi and Z2 of lengths n and m, respectively, such that 
wizi = W2Z2 in Thompson's group F . 

The following elegant and simple proof is due to Victor Guba. 

Proof. Construct the following van Kampen diagram (see e.g. [13] for 
the definition of a van Kampen diagram). On a square lattice, mark 
one point as the origin. Starting at the origin and going to the right, 
write the word wi by marking edges of the lattice by the letters of wi, 
read left to right. Then, starting at the origin and going up, write the 
word W2 by marking edges of the lattice by the letters of W2 , read left 
to right. 

Now start marking edges of the lattice inside the rectangle built on 
segments of length m (horizontally) and n (vertically) corresponding 
to the words wi and W2, as follows. All horizontal edges in the lattice 
are directed from left to right, and all vertical edges are directed from 
bottom to top. Then, suppose a single square cell of the lattice has: 
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• Xi on the lower edge and Xi on the left edge. Then we mark 
the upper edge and the right edge of this cell with the same Xi. 
This cell now corresponds to the relation 

• Xi on the lower edge and Xj on the left edge, where i < j. 
Then we mark the upper edge of this cell with Xi, and the 
right edge with Xj+i. This cell now corresponds to the relation 

XjXiXj_^^Xj^ — 1, or XjXi — XiXj-^\. 

• Xi on the lower edge and Xj on the left edge, where i > j. 
Then we mark the upper edge of this cell with Xj+i, and the 
right edge with xj. This cell now corresponds to the relation 

j -^i — 1; Ol" -^j-^i+l — XiXj. 

After all edges of the rectangle built on segments corresponding 
to the words wi and W2 are marked, we read a relation of the form 
W2UiU2^wY^ = 1, or W2U1 = W1U2, off the edges of this rectangle. Here 
the length of ui is m and the length of U2 is n. This completes the 
proof. 

□ 

Example 1. Ifwi = xiX2 and W2 = x^x^, this method gives wix^xj = 
W2X1X2. 

Pr op osit ion 14 . 2 1 implies . in particular, that it is impossible to tell that 
two positive words of length L in the alphabet X = {xq, xi, X2, . . .} are 
not equal in Thompson's group F by inspecting their initial segments 
of length < i.e., there is at least no such straightforward sublinear 
time algorithm for detecting inequality in F"^ . 

4.3. Positive braid monoids. Braid groups need no introduction; we 
just refer to the monograph ^3] for background. Some notation has to 
be recalled though. We denote the braid group on Ti strands by Bn't 
this group has a standard presentation 

(cji, ...,cr„_i| aiUj = (TjUi if |z-j| > 1; o-jCJi+icrj = o-j+ifJifJi+i for 1 < i < n-2). 

We shall call elements of Bn braids, as opposed to hraid words that are 
elements of the ambient free group on ai, ...,(T„_i. 

Since all defining relators of a braid group are positive words, we can 
consider the positive braid monoid; denote it by B^. 

It turns out that, in contrast to the situation with positive monoids 
Mc and F'^ considered in two previous sections of this paper, for at 
least some pairs of positive words in i?+ there is a sublinear time test 
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for inequality. The following proposition follows from the results of [2]; 
in particular, from the proof of their Proposition 2.9. 

Proposition 4.3. Let wi = aia^ ■ ■ ■ a2m-i , W2 = o'2mO'2m-2 • • • <72- 
Suppose wiu = W2V for some u,v £ B^, n > 2m. Then \u\, \v\ = 2m^. 

Thus, in particular, if one has two positive braid words of length L, 
where one of them starts with aia^ ■ ■ ■ cr2fc_i, the other one starts with 
C2A;C2fc-2 • • • C25 and k > \/L, then these braid words are not equal in 
B+, n > 2k. 

Of course, this is just a very special example where a sublinear time 
algorithm can detect inequality of two words in B^, so the interesting 
question is whether examples of this sort are "generic". We therefore 
ask: 

Problem 2. Is there a generic subset S (in the sense of Definition 
\4-^ of B^ and a number e > such that for any two words wi , W2 of 
length k representing elements of S, the minimum length of words u, v 
such that wiu = W2V, is greater than /c(^+'^) ? 

Acknowledgement. The author is grateful to Victor Cuba and Patrick 
Dehornoy for helpful discussions. 
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